Statistical Conversion Algorithms of Pitch Contours Based on Prosodic Phrases
نویسندگان
چکیده
Pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speaker’s utterance be converted into that of the target speaker. This paper investigates statistical algorithms of pitch contour conversion for Korean language. The algorithms are based on Gaussian normalization, and its combination with a declination-line modeling of pitch contour. Pitch contour conversion are investigated at two levels of prosodic phrases: intonation phrase and accentual phrase. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the algorithms for intonational phrases in pitch contour conversion.
منابع مشابه
A Strategy for Pitch Conversion and Its Evaluation
In order to transform the perceived speaker identity, a voice conversion system should, a.o., convert the speaker’s prosodic characteristics. When considering pitch contours, most systems only transform the pitch by simple scaling. A stochastic system that transforms pitch contours taking into account multiple pitch parameters has been developed and is described. A pitch transplantation system ...
متن کاملProsody Annotation for Unit Selection Tts Synthesis
This paper concerns prosody annotation and intonation modeling, especially for the application in a corpus based speech synthesis. In order to establish the rules of the automatic intonation modeling, a four hour fully annotated speech database has been acoustically and perceptually analyzed. The speech material included different text types, dialogs and prosodically rich phrases. As the result...
متن کاملProsody annotation for corpus based speech synthesis
The paper concerns prosody annotation especially for application in a corpus based speech synthesis. In order to establish the rules of automatic intonation modelling, phonetically labeled speech database of 4 hours has been perceptually and acoustically analyzed. The speech material included different text types and prosodically rich phrases. The annotation of the speech database consists in p...
متن کاملProsodic phrase segmentation by pitch pattern clustering
This paper proposes a novel method for detect,ing the optimal sequence of prosodic phrases from continuous speech based on data-driven approach. The pitch pattern of input speech is divided into prosodic segments which minimized the overall distortion with pitch pattern templates of accent phrases by using the One Pass search algorithm. The pitch pattern templates are designed by clustering a l...
متن کاملStatistical prosodic modeling: from corpus design to parameter estimation
The increasing availability of carefully designed and collected speech corpora opens up new possibilities for the statistical estimation of formal multivariate prosodic models. At Apple Computer, statistical prosodic modeling exploits the Victoria corpus, recently created to broadly support ongoing speech synthesis research and development. This corpus is composed of five constituent parts, eac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003